Symbol Sequence Search from Telephone Conversation

نویسندگان

  • Masayuki Suzuki
  • Gakuto Kurata
  • Abhinav Sethy
  • Bhuvana Ramabhadran
  • Kenneth Ward Church
  • Mark Drake
چکیده

We propose a method for searching for symbol sequences in conversations. Symbol sequences can include phone numbers, credit card numbers, and any kind of ticket (identification) numbers and are often communicated in call center conversations. Automatic extraction of these from speech is a key to many automatic speech recognition (ASR) applications such as question answering and summarization. Compared with spoken term detection (STD), symbol sequence searches have two additional problems. First, the entire symbol sequence is typically not observed continuously but in sub sequences, where customers or agents speak these sequences in fragments, while the recipient repeats them to ensure they have the correct sequence. Second, we have to distinguish between different symbol sequences, for example, phone numbers versus ticket numbers or customer identification numbers. To deal with these problems, we propose to apply STD to symbol-sequence fragments and subsequently use confidence scoring to obtain the entire symbol sequence. For the confidence scoring, We propose a long short-term memory (LSTM) based approach that inputs word before and after fragments. We also propose to detect repetitions of fragments and use it for confidence scoring. Our proposed method achieves a 0.87 F-measure, in an eight-digit customer identification number search task, when operating at 20.3% WER.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Telephone Conversation Closing Strategies Used by Persian Speakers: Rapport Management Approach

The use of politeness strategies can help interlocutors promote and/or maintain social harmony in telephone interactions. Using the Rapport Management Model proposed by Spencer-Oatey (2008), this study aimed primarily to reinvestigate the closing structures of telephone conversation (hereafter abbreviated as TC) in Persian and to discover the common politeness strategies used by native Persian ...

متن کامل

Telephone conversation impairs sustained visual attention via a central bottleneck.

Recent research has shown that holding telephone conversations disrupts one's driving ability. We asked whether this effect could be attributed to a visual attention impairment. In Experiment 1, participants conversed on a telephone or listened to a narrative while engaged in multiple object tracking (MOT), a task requiring sustained visual attention. We found that MOT was disrupted in the tele...

متن کامل

Telephone Conversation Closing Strategies Used by Persian Speakers: Rapport Management Approach

The use of politeness strategies can help interlocutors promote and/or maintain social harmony in telephone interactions. Using the Rapport Management Model proposed by Spencer-Oatey (2008), this study aimed primarily to reinvestigate the closing structures of telephone conversation (hereafter abbreviated as TC) in Persian and to discover the common politeness strategies used by native Persian ...

متن کامل

Telephone Conversation Closing Structure Across English and Persian

Due to the lack of paralinguistic information, politeness gains a considerable significance in telephone conversations (TCs). The use of politeness strategies can help interlocutors promote and/or maintain social harmony in telephone interactions. Using the Rapport Management Model proposed by Spencer-Oatey (2008), this study intended to primarily investigate the fundamental closing structures ...

متن کامل

Incremental diarization of telephone conversations

Speaker diarization systems attempt segmentation and labeling of a conversation between R speakers, while no prior information is given regarding the conversation. Most state of the art diarization systems require the full body of the conversation data prior to the application of some diarization approach. However, for some applications such as forensics, which handles vast amount of data, an o...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017